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SEQUENCE LISTING 



in 

^3 



<110> Strathmann AG & Co . 

<120> Virus-Vaccine 

<130> P057760 

<140> 
<141> 

<150> 199 07 485.2 
<151> 1999-02-12 

<160> 12 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9709 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 1 

tggaagggct 

cacaaggc ta 

tgacctttgg 

aataaggaga 

agggagaagt 

age tgcatcc 

ctggggactt 

gc tacatata 

gectgggage 

tgagtgctca 

agaccctttt 

cgaaagtaaa 

caagaggega 

aggagagaga 

aaaattcggt 

agcagggagc 

agacaaatac 

ttatataata 

aaggaagee t 

gcagcagctg 

ctccaggggc 

gtagtagaag 

ggagccaccc 

atgcaaatgt 

gtgcatgcag 

ggaac tac ta 

gtaggagaaa 

agece tacca 

gaccgattc t 

acagaaacc t 

ggaccaggag 

cataaagcaa 

atacagaaag 

gaagggcaca 

aaggaaggac 

tggccttccc 

ccaccagaag 

ccgatagaca 

tegtcacaat 

atacagtatt 

ttggaggttt 

aagctatagg 

tgac tcagat 

aat taaagee 



aatttggtcc 
cttccctgat 
atggtgcttc 
gaagaacagc 
attagtgtgg 
ggagtac tac 
tccagggagg 
ageagctget 
tctc tggcta 
aagtagtgtg 
agtcagtgtg 
gecagaggag 
ggggeggega 
tgggtgcgag 
taaggecagg 
tagaacgat t 
tgggacagct 
caatagcagt 
tagataagat 
acacaggaaa 
aaatggtaca 
agaaggcttt 
cacaagattt 
taaaagagac 
ggcctattgc 
gtaccc ttca 
tc ta taaaag 
gcattctgga 
ataaaac tc t 
tgttggtcca 
cgacac taga 
gagttttggc 
gcaa tt ttag 
tagecaaaaa 
accaaatgaa 
acaagggaag 
agagc ttcag 
aggaactgta 
aaagataggg 
agaagaaatg 
tatcaaagta 
tacagtatta 
tggc tgcact 
aggaatggat 



caaaaaagac 
tggcagaac t 
aagttagtac 
ttgttacacc 
aagtt tgaca 
aaagac tgc t 
tgtggcctgg 
ttttgcctgt 
ac tagggaac 
tgcccgtctg 
gaaaatctc t 
atctc tcgac 
ctggtgagta 
agegteggta 
gggaaagaaa 
cgcagt taat 
acaaccatcc 
cc tctattgt 
agaggaagag 
caacagccag 
teaggecata 
cagcccagaa 
aaataccatg 
catcaatgag 
accaggccag 
ggaacaaata 
atggataatc 
ca taagacaa 
aagagecgag 
aaatgegaac 
agaaatgatg 
tgaagcaatg 
gaaccaaaga 
ttgeagggee 
agattgtac t 
gecagggaat 
gtttggggaa 
tcctt tagct 
gggcaattaa 
aatttgccag 
ggacagtatg 
gtaggaccta 
ttaaat tttc 
ggcccaaaag 



aagagatcc t 
acacaccagg 
cagt tgaacc 
c tatgageca 
gec tec tagc 
gacatcgagc 
gcgggactgg 
actgggtctc 
ccac tgc t ta 
ttgtgtgact 
agcagtggcg 
gcaggactcg 
cgccaaaaat 
ttaagcgggg 
caatataaac 
cc tggee ttt 
cttcagacag 
gtgea tcaaa 
caaaacaaaa 
gtcagccaaa 
tcacc tagaa 
gtaataccca 
c taaacacag 
gaagctgcag 
atgagagaac 
ggatggatga 
ctgggattaa 
ggaccaaagg 
caagc t tcac 
ccagattgta 
acagcatgtc 
agecaagtaa 
aagac tgtta 
cc taggaaaa 
gagagacagg 
tt tc ttcaga 
gagacaacaa 
tccc tcagat 
aggaagctct 
gaagatggaa 
atcagatact 
cacc tgtcaa 
ccat tagtcc 
t taaacaa tg 



tgatctgtgg 
gecagggate 
agagcaagta 
gcatgggatg 
atttegtcac 
tttctacaag 
ggagtggcga 
tctggttaga 
agee tcaata 
ctggtaac ta 
cccgaacagg 
gettgetgaa 
tttgac tagc 
gagaattaga 
taaaacatat 
tagagacatc 
gatcagaaga 
ggatagatgt 
gtaagaaaaa 
attaccc tat 
c tttaaatgc 
tgttt tcagc 
tggggggaca 
aatgggatag 
caaggggaag 
cacataa tec 
ataaaatagt 
aaccc 1 1 tag 
aagaggtaaa 
agac tattt t 
agggagtggg 
caaatccagc 
agtgt ttcaa 
agggctgttg 
ctaatttttt 
gcagaccaga 
ctccc tctca 
cac tc tttgg 
attagataca 
accaaaaatg 
catagaaatc 
cataa ttgga 
tat tgagac t 
geca ttgaca 



ate taccaca 
agatatccac 
gaagaggeca 
gaggaccegg 
atggcccgag 
ggactttccg 
gccctcagat 
ccagatc tga 
aagcttgect 
gagatccc tc 
gac ttgaaag 
gcgcgcacgg 
ggaggctaga 
taaatgggaa 
agtatgggca 
agaaggctgt 
ac t tagatca 
aaaagacacc 
ggcacagcaa 
agtgcagaac 
atgggtaaaa 
at tatcagaa 
tcaagcagcc 
attgeatcca 
tgacatagca 
ace ta tccca 
aagaatgtat 
agac tatgta 
aaattggatg 
aaaagcat tg 
gggacccggc 
taccataatg 
ttgtggcaaa 
gaaatgtgga 
agggaagatc 
gccaacagcc 
gaagcaggag 
cagcgacccc 
ggagcagatg 
atagggggaa 
tgeggacata 
agaaatc tgt 
gtaccagtaa 
gaagaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 . 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100- 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 




• * taaaagcatt agtagaaatt tgtacagaaa 

ggcctgaaaa tccatacaat actccagtat 
ggagaaaatt agtagatttc agagaactta 
aattaggaat accacatcct gcagggttaa 
tgggcgatgc atatttttca gttcccttag 
ccatacctag tataaacaat gagacaccag 
agggatggaa aggatcacca gcaatattcc 
ttagaaaaca aaatccagac atagtcatct 
ctgacttaga aatagggcag catagaacaa 
ggtggggatt taccacacca gacaaaaaac 
gttatgaact ccatcctgat aaatggacag 
gctggactgt caatgacata cagaaattag 
atgcagggat taaagtaagg caattatgta 
aagtagtacc actaacagaa gaagcagagc 
aagaaccggt acatggagtg tattatgacc 
agcaggggca aggccaatgg acatatcaaa 
caggaaaata tgcaagaatg aagggtgccc 
cagtacaaaa aatagccaca gaaagcatag 
tacccataca aaaggaaaca tgggaagcat 
ttcctgagtg ggagtttgtc aatacccctc 
^ aagaacccat aataggagca gaaactttct 

:= J aattaggaaa agcaggatat gtaactgaca 

■3 acacaacaaa tcagaagact gagttacaag 

==0 tagaagtaaa catagtgaca gactcacaat 

ataagagtga atcagagtta gtcagtcaaa 
l s l tctacctggc atgggtacca gcacacaaag 

tggtcagtgc tggaatcagg aaagtactat 
aacatgagaa atatcacagt aattggagag 
□1 tagtagcaaa agaaatagta gccagctgtg 

=J3 atggacaagt agactgtagc ccaggaatat 

.*~ aagttatctt ggtagcagtt catgtagcca 

^ cagagacagg gcaagaaaca gcatacttcc 

W aaacagtaca tacagacaat ggcagcaatt 

CO ggtgggcggg gatcaagcag gaatttggca 

tagaatctat gaataaagaa ttaaagaaaa 
| as% atcttaagac agcagtacaa atggcagtat 

!:p ttggggggta cagtgcaggg gaaagaatag 

Q aagaattaca aaaacaaatt acaaaaattc 

j«* gagatccagt ttggaaagga ccagcaaagc 

tacaagataa tagtgacata aaagtagtgc 
atggaaaaca gatggcaggt gatgattgtg 
tggaaaagat tagtaaaaca ccatatgtat 
agacatcact atgaaagtac taatccaaaa 
gatgctaaat tagtaataac aacatattgg 
ttgggtcagg gagtctccat agaatggagg 
gacctagcag accaactaat tcatctgcac 
agaaatacca tattaggacg tatagttagt 
aaggtaggat ctctacagta cttggcacta 
ccacctttgc ctagtgttag gaaactgaca 
aagggccaca gagggagcca tacaatgaat 
gtgaagctgt tagacatttt cctaggatat 
aaacttacgg ggatacttgg gcaggagtgg 
tgtttatcca tttcagaatt gggtgtcgac 
gagcaagaaa tggagccagt agatcctaga 
cctaaaactg cttgtaccaa ttgctattgt 
ttcatgacaa aagccttagg catctcctat 
gctcatcaga acagtcagac tcatcaagct 
atgcaaccta taatagtagc aatagtagca 
gtgtggtcca tagtaatcat agaatatagg 
ttaattgata gactaataga aagagcagaa 
tcagcacttg tggagatggg ggtggaaatg 
ctgtagtgct acagaaaaat tgtgggtcac 
agcaaccacc actctatttt gtgcatcaga 
tgtttgggcc acacatgcct gtgtacccac 
aaatgtgaca gaaaatttta acatgtggaa 
tataatcagt ttatgggatc aaagcctaaa 
tagtttaaag tgcactgatc tgaagaatga 
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tggaaaagga aggaaaaatt tcaaaaattg 2700 
ttgccataaa gaaaaaagac agtactaaat 2760 
ataagagaac tcaagatttc tgggaagttc 2820 
aacagaaaaa atcagtaaca gtactggatg 2880 
ataaagactt caggaagtat actgcattta 2940 
ggattagata tcagtacaat gtgcttccac 3000 
agtgtagcat gacaaaaatc ttagagcctt 3060 
atcaatacat ggatgatttg tatgtaggat 3120 
aaatagagga actgagacaa catctgttga 3180 
atcagaaaga acctccattc ctttggatgg 3240 
tacagcctat agtgctgcca gaaaaggaca 3300 
tgggaaaatt gaattgggca agtcagattt 3360 
aacttcttag gggaaccaaa gcactaacag 3420 
tagaactggc agaaaacagg gagattctaa 3480 
catcaaaaga cttaatagca gaaatacaga 3540 
tttatcaaga gccatttaaa aatctgaaaa 3600 
acactaatga tgtgaaacaa ttaacagagg 3660 
taatatgggg aaagactcct aaatttaaat 3720 
ggtggacaga gtattggcaa gccacctgga 3780 
ccttagtgaa gttatggtac cagttagaga 3840 
atgtagatgg ggcagccaat agggaaacta 3900 
gaggaagaca aaaagttgtc cccctaacgg 3960 
caattcatct agctttgcag gattcgggat 4020 
atgcattggg aatcattcaa gcacaaccag 4080 
taatagagca gttaataaaa aaggaaaaag 4140 
gaattggagg aaatgaacaa gtagatgggt 4200 
ttttagatgg aatagataag gcccaagaag 4260 
caatggctag tgattttaac ctaccacctg 4320 
ataaatgtca gctaaaaggg gaagccatgc 4380 
ggcagctaga ttgtacacat ttagaaggaa 4440 
gtggatatat agaagcagaa gtaattccag 4500 
tcttaaaatt agcaggaaga tggccagtaa 4560 
tcaccagtac tacagttaag gccgcctgtt 4620 
ttccctacaa tccccaaagt caaggagtaa 4680 
ttataggaca ggtaagagat caggctgaac 4740 
tcatccacaa ttttaaaaga aaagggggga 4800 
tagacataat agcaacagac atacaaacta 4860 
aaaattttcg ggtttattac agggacagca 4920 
tcctctggaa aggtgaaggg gcagtagtaa 4980 
caagaagaaa agcaaagatc atcagggatt 5040 
tggcaagtag acaggatgag gattaacaca 5100 
atttcaagga aagctaagga ctggttttat 5160 
ataagttcag aagtacacat cccactaggg 5220 
ggtctgcata caggagaaag agactggcat 5280 
aaaaagagat atagcacaca agtagaccct 5340 
tattttgatt gtttttcaga atctgctata 5400 
cctaggtgtg aatatcaagc aggacataac 5460 
gcagcattaa taaaaccaaa acagataaag 5520- 
gaggacagat ggaacaagcc ccagaagacc 5580 
ggacactaga gcttttagag gaacttaaga 5640 
ggctccataa cttaggacaa catatctatg 5700 
aagccataac aagaattctg caacaactgc 5760 
atagcagaat aggcgttact cgacagagga 5820 
ctagagccct ggaagcatcc aggaagtcag 5880 
aaaaagtgtt gctttcattg ccaagtttgt 5940 
ggcaggaaga agcggagaca gcgacgaaga 6000 
tctctatcaa agcagtaagt agtacatgta 6060 
ttagtagtag caataataat agcaatagtt 6120 
aaaatattaa gacaaagaaa aatagacagg 6180 
gacagtggca atgagagtga aggagaagta 6240 
gggcaccatg ctccttggga tattgatgat 6300 
agtctattat ggggtacctg tgtggaagga 6360 
tgctaaagca tatgatacag aggtacataa 6420 
agaccccaac ccacaagaag tagtattggt 6480 
aaacgacatg gtagaacaga tgcatgagga 6540 
gccatgtgta aaattaaccc cactctgtgt 6600 
tactaatacc aatagtagta gcgggagaat 6660 




gataatggag aaaggagaga taaaaaactg 
taaggtgcag aaagaatatg cattctttta 
cagctacagg ttgataagtt gtaacacctc 
ctttgagcca attcccatac attattgtgc 
taataagacg ttcaatggaa caggaccatg 
tggaatcagg ccagtagtat caactcaact 
tgtagtaatt agatctgcca atttcacaga 
cacatctgta gaaattaatt gtacaagacc 
ccagagggga ccagggagag catttgttac 
acattgtaac attagtagag caaaatggaa 
aagagaacaa tttggaaata ataaaacaat 
agaaattgta acgcacagtt ttaattgtgg 
actgtttaat agtacttggt ttaatagtac 
aggaagtgac acaatcacac tcccatgcag 
agtaggaaaa gcaatgtatg cccctcccat 
tactgggctg ctattaacaa gagatggtgg 
acctggagga ggcgatatga gggacaattg 
aaaaattgaa ccattaggag tagcacccac 
aaaaagagca gtgggaatag gagctttgtt 
tatgggctgc acgtcaatga cgctgacggt 
" s i gcagcagcag aacaatttgc tgagggctat 

^ agtctggggc atcaaacagc tccaggcaag 

tcaacagctc ctggggattt ggggttgctc 
Ms ttggaatgct agttggagta ataaatctct 

is| ggagtgggac agagaaatta acaattacac 

i'T gcaaaaccag caagaaaaga atgaacaaga 

{ !Z gtggaattgg tttaacataa caaattggct 

y] aggaggcttg gtaggtttaa gaatagttt t 

kQ gcagggatat tcaccattat cgtttcagac 

gcccgaagga atagaagaag aaggtggaga 
. SS a gaacggatcc ttagcactta tctgggacga 

ccgcttgaga gacttactct tgattgtaac 
W gtgggaagcc ctcaaatatt ggtggaatct 

M» tagtgctgtt aacttgctca atgccacagc 

tatagaagta ttacaagcag cttatagagc 
:=S gggcttggaa aggattttgc tataagatgg 

^ gatggcctgc tgtaagggaa agaatgagac 

cagtatctcg agacctagaa aaacatggag 
atgctgcttg tgcctggcta gaagcacaag 
ctcaggtacc tttaagacca atgacttaca 
aagaaaaggg gggactggaa gggctaattc 
tgtggatcta ccacacacaa ggctacttcc 
gggtcagata tccactgacc tttggatggt 
aggtagaaga ggccaataaa ggagagaaca 
gaatggatga ccctgagaga gaagtgttag 
atcacgtggc ccgagagctg catccggagt 
acaagggact ttccgctggg gactttccag 
ggcgagccct cagatgctgc atataagcag 
ttagaccaga tctgagcctg ggagctctct 
caataaagct tgccttgagt gcttcaagta 
aactagagat ccctcagacc cttttagtca 
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ctctttcaat atcagcacaa gcataagaga 6720 
taaacttgat atagtaccaa tagataatac 6780 
agtcattaca caggcctgtc caaaggtatc 6840 
cccggctggt tttgcgattc taaaatgtaa 6900 
tacaaatgtc agcacagtac aatgtacaca 6960 
gctgttaaat ggcagtctag cagaagaaga 7020 
caatgctaaa accataatag tacagctgaa 7080 
caacaacaat acaagaaaaa gtatccgtat 7140 
aataggaaaa ataggaaata tgagacaagc 7200 
tgccacttta aaacagatag ctagcaaatt 7260 
aatctttaag caatcctcag gaggggaccc 7320 
a gggg aattt ttctactgta attcaacaca 7380 
tt: gg a g tact gaagggtcaa ataacactga 7440 
aataaaacaa tttataaaca tgtggcagga 7500 
cagtggacaa attagatgtt catcaaatat 7560 
taataacaac aatgggtccg agatcttcag 7620 
gagaagtgaa ttatataaat ataaagtagt 7680 
caaggcaaag agaagagtgg tgcagagaga 7740 
ccttgggttc ttgggagcag caggaagcac 7800 
acaggccaga caattattgt ctgatatagt 7860 
tgaggcgcaa cagcatctgt tgcaactcac 7920 
aatcctggct gtggaaagat acctaaagga 7980 
tggaaaactc atttgcacca ctgctgtgcc 8040 
ggaacagatt tggaataaca tgacctggat 8100 
aagcttaata cactccttaa ttgaagaatc 8160 
attattggaa ttagataaat gggcaagttt 8220 
gtggtatata aaattattca taatgatagt 8280 
tgctgtactt tctatagtga atagagttag 8340- 
ccacctccca atcccgaggg gacccgacag 8400 
gagagacaga gacagatcca ttcgattagt 8460 
tctgcggagc ctgtgcctct tcagctacca 8520 
gaggattgtg gaacttctgg gacgcagggg 8580 
cctacagtat tggagtcagg aactaaagaa 8640 
catagcagta gctgagggga cagatagggt 8700 
tattcgccac atacctagaa gaataagaca 8760 
gtggcaagtg gtcaaaaagt agtgtgattg 8820 
gagctgagcc agcagcagat ggggtgggag 8880 
caatcacaag tagcaataca gcagctaaca 8940 
aggaggaaga ggtgggtttt ccagtcacac 9000 
aggcagctgt agatcttagc cactttttaa 9060 
actcccaaag aagacaagat atccttgatc 9120 
ctgattggca gaactacaca ccagggccag 9180 
gctacaagct agtaccagtt gagccagata 9240 • 
ccagcttgtt acaccctgtg agcctgcatg 9300 
agtggaggtt tgacagccgc ctagcatttc 9360 
acttcaagaa ctgctgacat cgagcttgct 9420 
ggaggcgtgg cctgggcggg actggggagt 9480 
ctgctttttg cctgtactgg gtctctctgg 9540 
ggctaactag ggaacccact gcttaagcct 9600 
gtgtgtgccc gtctgttgtg tgactctggt 9660 
gtgtggaaaa tctctagca 9709 
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<210> 2 
<211> 854 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<223> Envelope Polyprotein 
<400> 2 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Lys 
15 10 15 

Trp Gly Thr Met Leu Leu Gly lie Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 

Gly Arg Met lie Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe Asn 
145 150 155 160 

lie Ser Thr Ser lie Arg Asp Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp lie Val Pro lie Asp Asn Thr Ser Tyr Arg Leu lie 
180 . 185 190 

Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Glu Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu 
210 215 220 

Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val 
225 230 235 240 

Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Val He Arg Ser 
260 265 270 

Ala Asn Phe Thr Asp Asn Ala Lys Thr He He Val Gin Leu Asn Thr 
275 280 285 
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Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Thr lie Gly Lys 
305 310 315 320 

lie Gly Asn Met Arg Gin Ala His Cys Asn lie Ser Arg Ala Lys Trp 
325 330 335 

Asn Ala Thr Leu Lys Gin lie Ala Ser Lys Leu Arg Glu Gin Phe Gly 
340 345 350 

Asn Asn Lys Thr lie lie Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu 
355 360 365 

lie Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
370 375 380 

Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr lie Thr Leu Pro Cys 
405 410 415 

Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met 
420 425 430 

Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys Ser Ser Asn lie Thr 
435 440 445 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu 
450 455 460 

lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
465 470 475 480 

Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro 
485 490 495 

Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly 
500 505 510 

lie Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 
515 . 520 525 

Gly Cys Thr Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
530 535 540 

Asp lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin 
545 550 555 560 

Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala 
565 570 575 

Arg lie Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly 
580 585 590 

lie Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala Val Pro Trp 
595 600 605 

Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp Asn Asn Met 



610 



615 



620 
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* Thr Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie 
625 630 635 640 

His Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin 
645 650 655 

Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn 
660 665 670 

lie Thr Asn Trp Leu Trp Tyr lie Lys Leu Phe lie Met lie Val Gly 
675 680 685 

Gly Leu Val Gly Leu Arg lie Val Phe Ala Val Leu Ser lie Val Asn 
690 695 700 

Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro 
705 710 715 720 

lie Pro Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly 
725 730 735 

Glu Arg Asp Arg Asp Arg Ser lie Arg Leu Val Asn Gly Ser Leu Ala 
740 745 750 

Leu lie Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 
755 760 765 

Leu Arg Asp Leu Leu Leu lie Val Thr Arg lie Val Glu Leu Leu Gly 



Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr 
785 790 795 800 

Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Asn Leu Leu Asn Ala Thr 
805 810 815 

Ala lie Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Leu Gin 
820 825 830 

Ala Ala Tyr Arg Ala He Arg His He Pro Arg Arg He Arg Gin Gly 
835 840 845 

Leu Glu Arg He Leu Leu 



850 



<210> 3 
<211> 107 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 



770 



775 



780 



<400> 3 

aagatgtagt aattagatct gccaatttca cagacaatgc taaaaccata atagtacagc 60 
tgaacacatc gttagaaatt aattgtacaa gacccaacaa caataca 107 




- 7 - 



*<210> 4 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<220> 

<221> misc_feature 
<222> (97).. (99) 

<223> Sequence at this position: (GA) (AT ) (GATC ) , le . 
base at position 97 can be G or A, base at 
position 98 can be A or T, and base at 
position 99 can be G, A, T or C. 



<400> 4 

ttttgctcta gaaatgttac aatgtgcttg tcttatgtct cctgttgcag cttctgttgc 60 
atgaaatgct ctccctggtc cgatatggat actatgrwnt tttcttgtat tgttgttggg 120 



<210> 5 
<211> 17 
I"* <212> DNA 

i,y <213> Artificial Sequence 

<220> 

<223> Description of the artificial sequence: 
y3 sequencing primer 

^ <400> 5 

ccatgtacaa atgtcag 17 



<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
sequencing primer 

<400> 6 

aaaactgtgc gttacaa 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
sequencing primer 



<400> 7 
gtaaaacgac 



ggccagt 



17 



- 8 - 

<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
sequencing primer 

<400> 8 

caggaaacag ctatgac 17 




;J3 



<210> 9 
<211> 2148 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: synthetic DNA 
<220> 

<221> misc__f eature 

<222> (3) . . (9) 

<223> BstEII cleavage site 

<220> 

<221> misc_feature 
<222> (2143) . . (2148) 
<223> BamHI cleavage site 

<400> 9 

tgggtcaccg tctattatgg ggtgcctgtg tggaaggaag caaccaccac tctattttgt 60 

gcatcagatg ctaaagcata tgatacagag gtacataatg tttgggccac acatgcctgt 120 

gtacccacag accccaaccc acaagaagta gtattggtaa atgtgacaga aaattttaac 180 

atgtggaaaa atgacatggt agaacagatg catgaggata taatcagttt atgggatcaa 240 

agccttaagc catgtgtaaa attaacccca ctctgtgtta gtttaaagtg cactgatttg 300 

aagaatgata ctaataccaa tagtagtagc gggagaatga taatggagaa aggagagata 360 

aaaaactgca gcttcaatat cagcacaagc ataagagata aggtgcagaa agaatatgca 420 

ttcttttata aacttgatat agtaccaata gataatacca gctataggtt gataagttgt 480 

aacacctcag tgatcacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 540 

tattgtgccc cggctggttt tgcgattcta aaatgtaata ataagacgtt caatggaaca 600 

ggaccatgta caaatgtcag cacagtacaa tgtacacatg gaattcgacc agtagtatca 660 

actcaactgc tgttaaatgg cagtctagca gaagaagatg tagtaattag atctgccaat 720 

ttcacagaca atgctaaaac cataatagta cagctgaaca catctgtaga aattaattgt 780 

acaagaccca acaacaatac aagaaaaagt atccgtatcc agaggggacc agggagagca 840 

tttgttacaa taggaaaaat aggaaatatg agacaagcac attgtaacat ttctagagca 900 

aaatggaatg ccactttaaa acagatagct agcaaattaa gagaacaatt tggaaataat 960 

aaaacaataa tctttaagca gtcatccgga ggggacccag aaattgtaac gcacagtttt 1020 

aattgtggag gggaattttt ctactgtaat tcaacacaac tgtttaatag tacttggttt 1080 

aatagtactt ggagtactga agggtcaaat aacactgaag gaagtgacac aatcacactc 1140 

ccatgcagaa taaaacaatt tataaacatg tggcaggaag taggaaaagc aatgtatgcc 1200 

cctcccatca gtggccaaat tagatgttca tcaaatatta ctgggctgct attaactcga 1260 

gatggtggta ataacaacaa tgggtccgag attttcagac ctggaggagg cgatatgagg 1320 

gataattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 1380 

gcacccacca aggcaaagag acgcgtggtg cagagagaaa agcgcgcagt gggaatagga 1440 

gctctgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 1500 

ctgacggtac aggccagaca attattgtct gatatagtgc agcagcagaa caatttgctg 1560 

agggcaattg aggcgcaaca gcatctgttg caactcacag tctggggcat caaacagctc 1620 

caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 1680. 

ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 1740 

aaatctctgg aacagatttg gaataacatg acctggatgg agtgggacag agaaattaac 1800 

aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 1860 

gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 1920 

aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 1980 

atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc accattatcg 2040 

tttcagaccc acctcccaat cccgagggga cccgacaggc ccgaaggaat agaagaagaa 2100 



ggtggagaga gagacagaga cagatccatt cgattagtga acggatcc 



2148 



<210> 10 
<211> 6229 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: synthetic DNA 
<220> 

<221> sig_peptide 
<222> (1293) . . (1295) 
<223> env ATG 



<220> 

<221> misc_feature 
<222> (1377) . . (1379) 
<223> env AGT , gpl20 start 

<220> 

<221> misc — feature 
<222> (1397) (1403) 
<223> BstEII cleavage site 

<220> 

<221> misc_f eature 
<222> (3537) . . (3542) 
<223> BamHI cleavage site 

<220> 

<221> misc_f eature 
<222> (3855) . . (3857) 
<223> env TAA , stop 



<400> 10 

ctgacgcgcc 

ccgctacact 

ccacgttcgc 

ttagtgcttt 

ggccatcgcc 

gtggactctt 

tataagggat 

ttaacgcgaa 

caactgttgg 

gggatgtgct 

taaaacgacg 

catagcccat 

ccgcccaacg 

atagggactt 

gtacatcaag 

cccgcctggc 

tacgtattag 

ggatagcggt 

ttgttttggc 

acgcaaatgg 

aaccgtcaga 

gacaattcga 

tgtggagatg 

c tacagaaaa 

ccac tc tatt 

ccacacatgc 

cagaaaattt 

gtttatggga 

agtgcactga 

agaaaggaga 



ctgtagcggc 
tgccagcgcc 
cggc tttccc 
acggcacctc 
ctgatagacg 
gttccaaact 
tttgccgatt 
ttttaacaaa 
gaagggcgat 
gcaaggcgat 
gccagtgagc 
atatggagtt 
acccccgccc 
tccattgacg 
tgtatcatat 
attatgccca 
tcatcgctat 
ttgactcacg 
accaaaatca 
gcggtaggcg 
tcgcctggag 
gctcggtacc 
ggggtggaaa 
attgtgggtc 
ttgtgcatca 
ctgtgtaccc 
taacatgtgg 
tcaaagcc ta 
tttgaagaat 
gataaaaaac 



gcattaagcg 
ctagcgcccg 
cgtcaagctc 
gaccccaaaa 
gtttttcgcc 
ggaacaacac 
tcggcc tatt 
atattaacgc 
cggtgcgggc 
taagttgggt 
gtctagttat 
ccgcgttaca 
attgacgtca 
tcaatgggtg 
gccaagtacg 
gtacatgacc 
taccatggtg 
gggatttcca 
acgggacttt 
tgtacggtgg 
acgccatcca 
gtcgacgcca 
tggggcacca 
accgtc tatt 
gatgctaaag 
acagacccca 
aaaaatgaca 
aagccatgtg 
gatactaata 
tgc tctttca 



cggcgggtgt 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 
ttacaatttc 
ctcttcgcta 
aacgccaggg 
taatagtaat 
taacttacgg 
ataatgacgt 
gagtatttac 
ccccctattg 
ttatgggact 
atgcggtttt 
agtctccacc 
ccaaaatgtc 
gaggtctata 
cgctgttttg 
ccatgagagt 
tgctccttgg 
atggggtacc 
catatgatac 
acccacaaga 
tggtagaaca 
taaaattaac 
ccaatagtag 
atatcagcac 



ggtggttacg 
tttcttccc t 
gc tcccttta 
gggtgatggt 
ggagtccacg 
ctcggtctat 
tgagctgatt 
cattcgccat 
ttacgccagc 
ttttcccagt 
caattacggg 
taaatggccc 
atgttcccat 
ggtaaactgc 
acgtcaatga 
ttcc tac ttg 
ggcagtacat 
ccattgacgt 
gtaacaactc 
taagcagagc 
acctccatag 
gaaggagaag 
gatattgatg 
tgtgtggaag 
agaggtacat 
agtagtattg 
gatgcatgag 
cccactc tgt 
tagcgggaga 
aagcataaga 



cgcagcgtga 
tcctttctcg 
gggttccgat 
tcacgtagtg 
ttctttaata 
tcttttgatt 
taacaaaaat 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
gtcattagtt 
gcctggctga 
agtaacgcca 
ccac ttggca 
cggtaaatgg 
gcagtacatc 
caatgggcgt 
caatgggagt 
cgccccattg 
tcgtttagtg 
aagacaccgg 
tatcagcact 
atctgtagtg 
gaagcaacca 
aatgtttggg 
gtaaatgtga 
gatataatca 
gt tagtttaa 
atgataatgg 
gataaggtgc 



60 

120 

180 • 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800' 



agaaagaata tgcattcttt 
ggttgataag ttgtaacacc 
caattcccat acattattgt 
cgttcaatgg aacaggacca 
ggccagtagt atcaactcaa 
ttagatctgc caatttcaca 
tagaaattaa ttgtacaaga 
gaccagggag agcatttgtt 
acattagtag agcaaaatgg 
aatttggaaa taataaaaca 
taacgcacag ttttaattgt 
atagtacttg gtttaatagt 
acacaatcac actcccatgc 
aagcaatgta tgcccctccc 
tgctattaac aagagatggt 
gaggcgatat gagggacaat 
aaccattagg agtagcaccc 
cagtgggaat aggagctttg 
gcacgtcaat gacgctgacg 
agaacaattt gctgagggct 
gcatcaaaca. gctccaggca 
tcctggggat ttggggttgc 
ctagttggag taataaatct 
acagagaaat taacaattac 
agcaagaaaa gaatgaacaa 
ggtttaacat aacaaattgg 
tggtaggttt aagaatagtt 
attcaccatt atcgtttcag 
gaatagaaga agaaggtgga 
ccttagcact tatctgggac 
gagacttact cttgattgta 
ccctcaaata ttggtggaat 
ttaacttgct caatgccaca 
tattacaagc agcttataga 
aaaggatttt gc tataagat 
gctgtaaggg aaagaatgag 
c gaga tc tag ac tagaacta 
acaaaccaca actagaatgc 
tgctttattt gtaaccatta 
ttttatgttt caggttcagg 
caaatgtggt atggctgatt 
ctctgacaca tgcagctccc 
agacaagccc gtcagggcgc 
cagtcacgta gcgatagcgg 
tactgagagt gcaccatatg 
cctgacgagc atcacaaaaa 
taaagatacc aggcgtttcc 
ccgc ttaccg gatacctgtc 
tcacgctgta ggtatctcag 
gaaccccccg ttcagcccga 
ccggtaagac acgacttatc 
aggtatgtag gcggtgctac 
aggacagtat ttggtatctg 
age tc ttgat ccggcaaaca 
cagattaege gcagaaaaaa 
gaegctcagt ggaacgaaaa 
atcttcacct agatcctttt 
gagtaaactt ggtctgacag 
tgtctatttc gttcatccat 
gagggcttac catctggccc 
ccagatttat cagcaataaa 
actttatccg cctccatcca 
ccagttaata gtttgcgcaa 
tcgtttggta tggcttcatt 
cccatgttgt gcaaaaaagc 
ttggccgcag tgttatcact 
ccatccgtaa gatgetttte 



- 10 - 

tataaacttg atatagtacc aatagataat accagctata 1860 
tcagtcatta cacaggcctg tccaaaggta tcctttgagc 1920 
gccccggctg gttttgegat tctaaaatgt aataataaga 1980 
tgtacaaatg tcagcacagt acaatgtaca catggaatca 2040 
ctgctgttaa atggcagtct agcagaagaa gatgtagtaa 2100 
gaeaatgeta aaaccataat agtacagctg aacacatctg 2160 
cccaacaaca atacaagaaa aagtatccgt atccagaggg 2220 
acaataggaa aaataggaaa tatgagacaa gcacattgta 2280 
aatgecaett taaaacagat agctagcaaa ttaagagaac 2340 
ataatcttta agcaatcctc aggaggggac ccagaaattg 2400 
ggaggggaat ttttctactg taattcaaca caactgttta 2460 
acttggagta ctgaagggtc aaataacact gaaggaagtg 2520 
agaataaaac aatttataaa catgtggcag gaagtaggaa 2580 
atcagtggac aaattagatg ttcatcaaat attactgggc 2640 
ggtaataaca acaatgggtc cgagatcttc agacctggag 2700 
tggagaagtg aattatataa atataaagta gtaaaaattg 2760 
accaaggcaa agagaagagt ggtgcagaga gaaaaaagag 2820 
ttccttgggt tcttgggagc agcaggaagc actatgggct 2880 
gtacaggeca gacaattatt gtctgatata gtgeagcage 2940 
attgaggege aacagcatct gttgeaaetc acagtctggg 3000' 
agaatcctgg ctgtggaaag atacctaaag gatcaacagc 3060 
tctggaaaac teatttgeae cactgctgtg ccttggaatg 3120 
ctggaacaga tttggaataa catgacctgg atggagtggg 3180 
acaagcttaa tacactcctt aattgaagaa tcgcaaaacc 3240 
gaattattgg aattagataa atgggcaagt ttgtggaatt 3300 
ctgtggtata taaaattatt cataatgata gtaggaggct 3360 
tttgctgtac tttctatagt gaatagagtt aggcagggat 3420 
acccacctcc caatcccgag gggacccgac aggeccgaag 3480 
gagagagaca gagacagatc cattcgatta gtgaaeggat 3540 
gatctgegga gcctgtgcct cttcagctac caccgcttga 3600 
acgaggattg tggaacttct gggaegcagg gggtgggaag 3660 
ctcctacagt attggagtca ggaactaaag aatagtgctg 3720 
gecatagcag tagctgaggg gacagatagg gttatagaag 3780 
gctattcgcc acatacctag aagaataaga cagggcttgg 3840 
gggtggcaag tggtcaaaaa gtagtgtgat tggatggcct 3900 
acgagctgag ccagcagcag atggggtggg agcagtatct 3960 
gcttcgatcc agacatgata agatacattg atgagtttgg 4020 
agtgaaaaaa atgetttatt tgtgaaattt gtgatgetat 4080 
taagctgcaa taaacaagtt aacaacaaca attgeattea 4140 
gggaggtgtg ggaggttttt taaagcaagt aaaacctcta 4200 
atgatcctgc ctcgcgcgtt tcggtgatga eggtgaaaac 4260 
ggagacggtc acagcttgtc tgtaagcgga tgeegggage 4320 
gtcagcgggn gttggcgggt gteggggege agccatgacc 4380 
agtgtatact ggcttaacta tgeggcatea gagcagattg 4440 
tcgggccgcg ttgctggcgt ttttccatag gctccgcccc 4500 
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 4560. 
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 4620 
cgcctttctc ecttegggaa gcgtggcgct ttctcatagc 4680 
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 4740 
ccgctgcgcc ttatccggta actategtet tgagtccaac 4800 
gccactggca gcagccactg gtaacaggat tagcagagcg 4860 
agagttcttg aagtggtggc etaactaegg ctacactaga 4920 
cgctctgctg aagccagtta ectteggaaa aagagttggt 4980 
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 5040 
aggatctcaa gaagatcctt tgatcttttc taeggggtet 5100 
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 5160 
aaattaaaaa tgaagtttta aatcaatcta aagtatatat 5220 
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 5280 
agttgcctga ctccccgtcg tgtagataac tacgataegg 5340 
cagtgetgea atgatacege gagacccacg ctcaccggct 5400 
ccagccagcc ggaagggccg agegcagaag tggtcctgea 5460 
gtctattaat tgttgccggg aagctagagt aagtagttcg 5520 
cgttgttgcc attgetacag gcatcgtggt gtcacgctcg 5580 
cagctccggt tcccaacgat caaggegagt tacatgatcc 5640 
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 5700 
catggttatg gcagcactgc ataattctct tactgtcatg 5760 
tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 5820 



- 11 - 



tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat 5880 

agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 5940 

atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 6000 

gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 6060 

aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 6120 

tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 6180 

aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccac 6229 



<210> 11 
<211> 860 
<212> DNA 

<213> Human immunodeficiency virus 
<220> 

<221> misc_f eature 
<222> (1) . . (860) 

<223> PI-932 original sequence Vl-V2-V3-loop 



<400> 11 

tgtgtaccca 

aacatgtgga 

caaagcctaa 

gctgatttaa 

actcataaca 

ttcaaggtca 

cttgatatag 

aacacctcga 

ttt tgtgccc 

ggaccatgta 

ac tcaactgc 

ttcacagaca 

ataagacccc 

acaacaagaa 

tggaataaca 



cagaccccaa 
aaaatgacat 
agccatgtgt 
attgcaataa 
atactattag 
ccacaggcat 
taccaataga 
ccattacaca 
cggctggttt 
aaaatgtcag 
tgttgaatgg 
atgc taaaac 
acaacac tgt 
aaataaaagg 
c t t tacaaga 



cccacaaaag 
ggtagaacag 
aaaactaacc 
tac tgattta 
tgggaaaata 
aagggataag 
taatgataag 
ggcctgtcca 
tgcgattcta 
cacagtacaa 
cagtctagca 
cataatagta 
aacagacagg 
agatataaga 



gtagtattgg 
atgcatgagg 
ccactc tgtg 
aattgcacta 
atagagaaag 
atgcaaaaag 
aataatac ta 
aaggtatcct 
aagtgtaatg 
tgtacacatg 
gaaaaagaag 
cagctgaagg 
atacatatag 
caagcacatt 



aaaatgtgac 
atataatcaa 
ttactttaaa 
aagc taattt 
tagaaataaa 
aatatgcac t 
ac tttatatt 
ttgagccaat 
aaaagagtta 
gaattaggcc 
tagtaattag 
aatctgtaaa 
ggccagggag 
gtagccttag 



agaaaatttt 
tttatgggat 
ttgcactgat 
ggggaaaaat 
aaac tgc tct 
tttgaataaa 
gataagttgt 
tcccatacat 
cagtggaaaa 
agtagtgtca 
ate tgagaat 
cattac ttgt 
atcatttcat 
gagaaaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

860 



<210> 12 
<211> 870 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the ..artificial sequence: PI-932 

gene cassette, comprising the cleavage sites for 
restriction enzymes BspTl, PstI, Bell, EcoRI, 
Bglll, PvuII, Xball, Nhel 



<400> 12 

tgtgtaccca 

aacatgtgga 

caaagee tta 

gctgatttaa 

actcataact 

ttcaaggtca 

cttgatatag 

aacacc tegg 

ttttgtgccc 

ggaccatgta 

ac tcaac tgc 

ttcacagaca 

ataagacccc 

acaacaagaa 

tggaataaca 



caga 
aaaa 
agee 
attg 
gcag 
ccac 
tacc 
tgat 
egge 
aaaa 
tgtt 
atgc 
acaa 
aaat 
cttt 



ccccaa 
tgacat 
atgtgt 
caataa 
tat tag 
aggcat 
aataga 
cacaca 
tggttt 
tgtcag 
gaatgg 
taaaac 
cac tgt 
aaaagg 
acaaga 



cccacaaaag 
ggtagaacag 
aaaactaacc 
tactgatt ta 
tgggaaaata 
aagggataag 
taatgataag 
ggcctgtcca 
tgegatte ta 
cacagtacaa 
cagtctagca 
cataatagta 
aacagacagg 
agatataaga 
gatagctagc 



gtagtattgg 
atgcatgagg 
ccac tc tgtg 
aattgcacta 
atagagaaag 
atgcaaaaag 
aataatac ta 
aaggtatcct 
aagtgtaatg 
tgtacacatg 
gaaaaagaag 
cagctgaagg 
atacatatag 
caagcacatt 



aaaatgtgac 
atataatcaa 
ttactttaaa 
aagctaat tt 
tagaaataaa 
aatatgcac t 
actttatatt 
ttgagccaat 
aaaagagtta 
gaatteggee 
tagtaattag 
aatctgtaaa 
ggccagggag 
gtagecttte 



agaaaatttt 
tttatgggat 
ttgeae tgat 
ggggaaaaat 
aaac tgc tct 
1 1 tgaataaa 
gataagttgt 
tcccatacat 
cagtggaaaa 
agtagtgtca 
atctgagaat 
cattac ttgt 
atcatttcat 
tagaaaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

870 



